An EM Algorithm for Localizing Multiple Sound Sources in Reverberant Environments
نویسندگان
چکیده
We present a method for localizing and separating sound sources in stereo recordings that is robust to reverberation and does not make any assumptions about the source statistics. The method consists of a probabilistic model of binaural multisource recordings and an expectation maximization algorithm for finding the maximum likelihood parameters of that model. These parameters include distributions over delays and assignments of time-frequency regions to sources. We evaluate this method against two comparable algorithms on simulations of simultaneous speech from two or three sources. Our method outperforms the others in anechoic conditions and performs as well as the better of the two in the presence of reverberation.
منابع مشابه
Models and Algorithms for Interactive audio Rendering
Realistic modeling of reverberant sound in 3D virtual worlds provides users with important cues for localizing sound sources and understanding spatial properties of the environment. Unfortunately, current geometric acoustic modeling systems do not accurately simulate reverberant sound. Instead, they model only direct transmission and specular reflection, while diffraction is either ignored or m...
متن کاملTwo-Microphone Spatial Filtering Improves Speech Reception for Cochlear-Implant Users in Reverberant Conditions With Multiple Noise Sources
This study evaluates a spatial-filtering algorithm as a method to improve speech reception for cochlear-implant (CI) users in reverberant environments with multiple noise sources. The algorithm was designed to filter sounds using phase differences between two microphones situated 1 cm apart in a behind-the-ear hearing-aid capsule. Speech reception thresholds (SRTs) were measured using a Coordin...
متن کاملStatistical sound source identification in a real acoustic environment for robust speech recognition using a microphone array
It is very important for a hands-free speech interface to capture distant talking speech with high quality. A microphone array is an ideal candidate for this purpose. However, this approach requires localizing the target talker. Conventional talker localization methods in multiple sound source environments not only have difficulty localizing the multiple sound sources accurately, but also have ...
متن کاملLow latency localization of multiple sound sources in reverberant environments.
Sound source localization algorithms determine the physical position of a sound source in respect to a listener. For practical applications, a localization algorithm design has to take into account real world conditions like multiple active sources, reverberation, and noise. The application can impose additional constraints on the algorithm, e.g., a requirement for low latency. This work define...
متن کاملTracking Multiple Acoustic Sources in Reverberant Environments
This paper concerns the problem of tracking acoustic sources in reverberant environments by using a particle filter. The localization problem is transformed into the retrieval of the unobservable state of a dynamical model through noisy measures. Though effective, two problems are related to particle filter: the degeneracy phenomenon (all particles but one are not significative) and the loss of...
متن کامل